Integer programming based heterogeneous CPU-GPU cluster schedulers for SLURM resource manager

نویسندگان

  • Seren Soner
  • Can C. Özturan
چکیده

We present two integer programming based heterogeneous CPU-GPU cluster schedulers, called IPSCHED and AUCSCHED, for the widely used SLURM resource manager. Our scheduler algorithms take windows of jobs and solve allocation problems in which free CPU cores and GPU cards are allocated collectively to jobs so as to maximize some objective functions. Our AUCSCHED scheduler employs an auction based approach in which bids for contiguous blocks of resources are generated for each job. We perform realistic SLURM emulation tests using the Effective System Performance (ESP) and our own synthetic workloads. Even though it is difficult to generalize, the tests roughly show that out of the three scheduling plugins, AUCSCHED achieves better utilization, spread and packing, IPSCHED achieves better waiting time and SLURM Backfill achieves better fragmentation performances when compared with each other. The SLURM scheduler plug-ins that implement our algorithm are available at http://code.google.com/p/slurm-ipsched/.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Co-Allocation Based Scheduling For Parallel Systems

State-of-the-art supercomputers are made up of multiple types of resources. User jobs also have wide spectrum of resource requirements. Hence, a supercomputer can be thought of as a collection of heterogeneous resources with heterogeneous usage requirements from the users. Schedulers for such systems are challenged by several issues like scalability, GPU, topology and energy awareness. We view ...

متن کامل

An OpenMP Programming Toolkit for Hybrid CPU/GPU Clusters Based on Software Unified Memory

Recently, hybrid CPU/GPU cluster has drawn much attention from the researchers of high performance computing because of amazing energy efficiency and adaptable resource exploitation. However, the programming of hybrid CPU/GPU clusters is very complex because it requires users to learn new programming interfaces such as CUDA and OpenCL, and combine them with MPI and OpenMP. To address this probl...

متن کامل

A novel cooperative accelerated parallel two-list algorithm for solving the subset-sum problem on a hybrid CPU-GPU cluster

Many parallel algorithms have recently been developed to accelerate solving the subset-sum problem on a heterogeneous CPU–GPU system. However, within each compute node, only one CPU core is used to control one GPU and all the remaining CPU cores are in idle state, which leads to a large number of CPU cores being wasted. In this paper, based on a cost-optimal parallel two-list algorithm, we prop...

متن کامل

A Study of Scheduling a Neuro - imaging Application On a Heterogeneous CPU - GPU Cluster by Reza Nakhjavani

A Study of Scheduling a Neuro-imaging Application On a Heterogeneous CPU-GPU Cluster Reza Nakhjavani Master of Applied Science Graduate Department of Electrical and Computer Engineering University of Toronto 2014 The ever increasing complexity of scientific applications has led to utilization of new HPC paradigms such as Graphical Processing Units (GPUs). However, modifying applications to run ...

متن کامل

SLURM: Simple Linux Utility for Resource Management

Simple Linux Utility for Resource Management (SLURM) is an open source, faulttolerant, and highly scalable cluster management and job scheduling system for Linux clusters of thousands of nodes. Components include machine status, partition management, job management, scheduling, and stream copy modules. This paper presents an overview of the SLURM architecture and functionality. 1 Overview Simpl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Comput. Syst. Sci.

دوره 81  شماره 

صفحات  -

تاریخ انتشار 2015